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Abstract 


We use ideas from distributed computing and game theory to study dynamic and decentralized 
environments in which computational nodes, or decision makers, interact strategically and with limited 
information. In such environments, which arise in many real-world settings, the participants act as both 
economic and computational entities. We exhibit a general non-convergence result for a broad class 
of dynamics in asynchronous settings. We consider implications of our result across a wide variety of 
interesting and timely applications: game dynamics, circuit design, social networks, Internet routing, 
and congestion control. We also study the computational and communication complexity of testing the 
convergence of asynchronous dynamics. Our work opens a new avenue for research at the intersection of 
distributed computing and game theory. 

1 Introduction 

Dynamic environments where decision makers repeatedly interact arise in a variety of settings, such as 
Internet protocols, large-scale markets, social networks, and multi-processor computer architectures. Study 
of these environments lies at the boundary of game theory and distributed computing. The decision makers 
are both strategic entities with individual economic preferences and computational entities with limited 
resources, working in a decentralized and uncertain environment. To understand the global behaviors that 
result from these interactions—the dynamics of these systems—we draw on ideas from both disciplines. 

The notion of self stabilization to a “legitimate” state in a distributed system parallels that of convergence 
to an equilibrium in a game. The foci, however, are very different. In game theory, there is extensive 
research on dynamics that result from what is perceived as natural strategic decision making (e.g., best- 
or better-response dynamics, fictitious play, or regret minimization). Even simple heuristics that require 
little information or computational resources can yield sophisticated behavior, such as the convergence 
of best-response dynamics to equilibrium points (see m and references therein). These positive results 
for simple game dynamics are, with few exceptions (see Section HD, based on the sometimes implicit and 
often unrealistic premise of a controlled environment in which actions are synchronous and coordinated. 
Distributed computing research emphasizes the environmental uncertainty that results from decentralization, 
but has no notion of “natural” rules of behavior. It has long been known that environmental uncertainty—in 
the form of both asynchrony [15] [30] and arbitrary initialization [10]— introduces substantial difficulties for 
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Colgate University; N. Lutz, Department of Computer Science, Rutgers University, email:njlutz@rutgers.edu; M. Schapira, 
School of Computer Science and Engineering, Hebrew University of Jerusalem, email: schapiram@huji.ac.il; R. N. Wright, 
Department of Computer Science and DIMACS, Rutgers University, email: rebecca.wright@rutgers.edu. 
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protocol termination in distributed systems. Our work bridges the gap between these two approaches by 
initiating the study of game dynamics in distributed computing settings. We take the first steps of this 
research agenda, focusing primarily on systems in which the decision makers, or computational nodes, are 
deterministic and have bounded reeall, meaning that their behavior is based only on the “recent history” of 
system interaction. Our model is asynchronous in the sense of allowing, at every timestep, an adversarially 
chosen subset of nodes to be activated. 

Our main contribution is a general impossibility result (Theorem [J) for asynchronous environments, 
showing that a large and natural class of bounded-recall dynamics can fail to converge whenever there 
are at least two “equilibrium points” of the dynamics. We prove this result using a valency argument 
(a now-standard technique in distributed computing theory mn]). We discuss the implications of this 
result for game dynamics and describe its applications to asynchronous circuit design, social networks, 
interdomain routing protocols such as BGP, and congestion control in networks. We also explore the impact 
on convergence guarantees of asynchrony that is bounded, and we present complexity hardness results for 
checking whether an asynchronous system will always converge: we show that it is PSPACE-hard and requires 
exponential communication. 


2 Related Work 

Our work relates to many ideas in game theory and in distributed computing. Here, we discuss game- 
theoretic work on the dynamics of simple strategies and on asynchrony, distributed computing work on fault 
tolerance and self stabilization, and other connections between game theory and computer science (for more, 
see [IH])- We also highlight the application areas we consider. 

Algorithmic game theory. Since our work draws on both game theory and computer science, it may be 
considered part of the broader research program of algorithmic game theory (AGT), which merges concepts 
and ideas from those two fields |35j . Three main areas of study in AGT have been algorithmic mechanism 
design, which applies concepts from computer science to economic mechanism design |34| ; the “price of anar¬ 
chy,” which describes the efficiency of equilibria and draws on approximability research EH; and algorithmic 
and complexity research on the computation of equilibria |36| . Analyzing the computational power of learn¬ 
ing dynamics in games has been of particular interest (see, e.g., [g ESI El El]). Our work creates another link 
between game theory and computer science by drawing on two previously disjoint areas, self-stabilization 
in distributed computing theory and game dynamics, to explore broader classes of dynamics operating in 
adversarial distributed environments. 

Adaptive heuristics. Much work in game theory and economics deals with adaptive heuristics (see m 
and references therein). Generally speaking, this long line of research explores the “convergence” of simple 
and myopic rules of behavior (e.g., best-response/fictitious-play/no-regret dynamics) to an “equilibrium”. 
However, with few exceptions (see below), such analysis has so far primarily concentrated on synchronous 
environments in which steps take place simultaneously or in some other predetermined order. In this work, we 
explore dynamics of this type in asynchronous environments, which are more realistic for many applications. 
Game-theoretic work on asynchronous environments. Some game-theoretic work on repeated games 
considers “asynchronous moves.” Often, as in m, this asynchrony merely indicates that players are not all 
activated at each time step, and thus is used to describe environments where only one player is activated at a 
time (“alternating moves”), or where there is a probability distribution that determines which player(s) are 
activated at each timestep. Other work does not explore the behavior of dynamics, but has other research 
goals (e.g., characterizing equilibria, establishing folk theorems); see [2g|42], among others, and references 
therein. To the best of our knowledge, we are the first to study the effects of asynchrony (in the broad 
distributed computing sense) on the convergence of game dynamics to equilibria. 

Fault-tolerant computation. We use ideas and techniques from work in distributed computing on protocol 
termination in asynchronous computational environments where nodes and communication channels are 
possibly faulty. Protocol termination in such environments, initially motivated by multi-processor computer 
architectures, has been extensively studied in the past three decades [HilliillESllsg], as nicely surveyed 
in [301 [M]. Fischer, Lynch and Paterson m showed, in a landmark paper, that a broad class of failure- 
resilient consensus protocols cannot provably terminate. Intuitively, the risk of protocol non-termination in 
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that work steins from the possibility of failures; a computational node cannot tell whether another node is 
silent due to a failure or is simply taking a long time to react. Our non-convergence result, by contrast, 
applies to failure-free environments. In game-theoretic work that incorporated fault tolerance concepts, 
Abraham et al. [T] studied equilibria that are robust to defection and collusion. 

Self stabilization. The concept of self stabilization is fundamental to distributed computing and dates 
back to Dijkstra 1974 [S] (see [T^ and references therein). Convergence of dynamics to an “equilibrium” in 
our model can be viewed as the self stabilization of such dynamics (where the “equilibrium points” are the 
legitimate configurations). Our formulation draws ideas from work in distributed computing (e.g.. Burns’ 
distributed daemon model 0) and in networking research m on self stabilization. 

Applications. We discuss the implications of our non-convergence result across a wide variety of applica¬ 
tions, that have previously been studied: convergence of game dynamics (see, e.g., mi HD) ; asynchronous 
circuits (see, e.g., 0); diffusion of innovations, behaviors, etc., in social networks (see [33l[24]); interdomain 
routing [ITl [40] ; and congestion control m- 


3 Asynchronous Dynamic Interaction 

In this section we present our model of asynchronous dynamic interaction. Intuitively, an interaction system 
consists of a collection of computational nodes, each capable of selecting actions that are visible to the 
other nodes. The state of the system at any time consists of each node’s current action. Each node has a 
deterministic reaction function that maps system histories to actions. At every discrete timestep, each node 
activated by a schedule simultaneously applies its deterministic reaction function to select a new action, 
which is immediately visible to all other nodes. 

Definition An interaction system is characterized by a tuple (n, A,f): 

• The system has n G Z+ computational nodes, labeled 1,..., n. 

• A = Al X ... X A„, where each A^ is a finite set called the action space of node i. A is called the 
state space of the system, and a state is an n-tuple a = (ai,..., a„) G A. A history of the system is a 
nonempty finite sequence of states, H G A^, for some £ G Z+. The set of all histories is A+ = Ufez+ 

• f : A+ —A is a function given by f{H) = {fi{H), ..., fn{H)), where fi : A+ —>• A^ is called node f’s 
reaction function. 


We now describe the asynchronous dynamics of our model, i.e., the ways that a system’s state can evolve 
due to interactions between nodes. Informally, there is some initial state, and, in each discrete time step 
1,2,3,..., a subset of the nodes are activated according to a schedule. The nodes that are activated in a 
given timestep react simultaneously; each applies its reaction function to the current state to choose a new 
action. This updated action is immediately observable to all other nodes0 

Definition Let S C [n] be a set of nodes. Define the function £5 : A+ —A by is{H) = ■. ■, fn{H)), 

where each function fi : A+ —Ai is given by 


/.(a° 


/*(a°,...,a^) ifiG^ 
of otherwise. 


A schedule is a function cr : Z_|_ that maps each t to a (possibly empty) subset of the computational 

nodesH If i G cr(t), then we say that node i is activated at time t. 


Since the reaction functions are deterministic, an initial history and a schedule completely determine the 
resulting infinite state sequence; we call these state sequences trajectories. 

^This model has “perfect monitoring.” While this is clearly unrealistic in some important real-life contexts (e.g., some of the 
environments considered in Section[5]l, this restriction only strengthens our main results, which are impossibility results. 

^[n] denotes {1,..., n}, and for any set S, 2® is the set of all subsets of S. 
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Definition Let H = (a°,..., a^) S be a history, and let cr be a schedule. The (iL, a)-trajectory of the 
system is the infinite sequence a°, a^, a^,... extending H such that for every t > i, 

a‘ = fa(t)(a°, ■ • ■ 

The history (a°,..., a‘“^) is the length-t prefix of the {H, cr)-trajectory. 

3.1 Fairness and Convergence 

Our main theorem is an impossibility result, in which we show that an adversarially chosen initial history 
and schedule can prevent desirable system behavior. Notice that an arbitrary schedule might never allow 
some or all nodes to react, or might stop activating them after some time. Hence, we limit this adversarial 
power (thereby strengthening our impossibility result) by restricting our attention to fair schedules, which 
never permanently stop activating any node. 

Definition A fair schedule is a schedule a that activates each node infinitely many times, i.e., for each 
i S [n], the set {t € Z-|_ : i € cr(t)} is infinite. A fair trajectory is one that is the {H, cr)-trajectory for some 
history H and some fair schedule a. 

We are especially interested in whether a system’s fair trajectories converge, eventually remaining at a 
single state forever. 

Definition A trajectory a°, a^, a^,... converges to a state b if there exists some T S Z+ such that, for all 
t > T, a* = b. The system is convergent if every fair trajectory converges. A state b is a limit state of the 
system if some fair trajectory converges to b. 

Note that it is possible for a trajectory to visit a limit state without converging to that state, meaning 
that limit states are not necessarily “stable” or “absorbing.” They may, however, have basins of attraction, 
in the sense that reaching certain histories might guarantee convergence to a given limit state. 

Definition A history H is committed to a limit state b if, for every fair schedule a, the {H, cr)-trajectory 
converges to b. An uncommitted history is one that is not committed to any state. 

3.2 Informational Restrictions on Reaction Pnnctions 

This framework allows for very powerful reaction functions. We now present several possible restrictions on 
the information they may use. These are illustrated in Fig. [1] 

Our main theorem concerns systems in which the reaction functions are self-independent, meaning that 
each node ignores its own past and present actions when reacting to the system’s state. In discussing self 
independence, we use the notation 

= Ai X ... X Ai_i X Ai+i X ... X An, 

the state space of the system when i is ignored. Similarly, for a state a, a_i G denotes 

(q1 , . . . , Qi — i, Qj+i, . . . , Un) 5 

and given a history H = (a°,..., a^“^), we write for ..., al“^). Using this notation, we formally 
define self independence. 

Definition A reaction function fi is self-independent if there exists a function gi : -P- Ai such that 

fi{H) = gi{H....i) for every history H G A+. 

A reaction function has bounded recall if it only depends on recent states. 

Definition Given k G Z+ and a history H = (a°,..., a‘“^) G A* with t > k, the k-history at H is 
:= (a*“*’,..., a‘“^), the fc-tuple of most recent states. A reaction function fi has k-recall if it only 
depends on the fc-history and the time counter, i.e., there exists a function pi : A^ x Z+ —>■ Ai such that 
fi{H) = gi{H\y.,t) for every time t>k and history H G A*. 
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We sometimes slightly abuse notation by referring to the restricted-domain function gi, rather than fi, as 
the node’s reaction function. 

A bounded-recall reaction function is stationary if it also ignores the time counter. 

Definition We say that a fc-recall reaction function is stationary if the time counter t is of no importance. 
That is, if there exists a function gi : ^ Ai such that fi{H) = gi{H\k) for every time t > k and history 

H G A*. A reaction function fi is historyless if fi is both 1-recall and stationary. That is, if fi only depends 
on the nodes’ most recent actions. 


While seemingly very restricted, historyless dynamics capture the prominent and extensively studied best- 
response dynamics from game theory (as we discuss in Section IQ) . We show in Section [5] that historyless 
dynamics also encompass a host of other applications of interest, ranging from Internet protocols to the 
adoption of technologies in social network. 


Figure 1: Shading shows the information about past and current actions available to node 3 at time t given 
different reaction function restrictions. Left: self-independent. Node 3 can see the entire record of other 
nodes’ past actions, but not its own. The length of this record gives the current timestamp t. Center: 
2-recall. Node 3 can see only the two most recent states. Unless the reaction function is stationary, it may 
also use the value of the current timestamp. Right: self-independent and historyless. Node 3 can only see 
other nodes’ most recent actions and cannot even see the value of the current timestamp. 


4 General Non-convergence Result 

We now present our main theorem, a general impossibility result for convergence of nodes’ actions under 
bounded-recall dynamics in asynchronous, distributed computational environments. 

Theorem 1 (Main theorem). In an interaction system where every reaction function is self-independent 
and has bounded recall, the existence of multiple limit states implies that the system is not convergent. 

We prove this theorem in Section ITT] bv using a valency argument. In Section 021 we show that the 
hypotheses of Theorem [1] are necessary. We then discuss in Section 14.31 the connections of this work to the 
famous result of Fischer et al. m on the impossibility of resilient consensus. 

Note that system convergence is closely related to self stabilization, which is a guarantee that the system 
will reach and remaining within a set of legitimate states. For a set L Q A, we say that a system self- 
stabilizes to L if, for every fair trajectory a°, a^, a^,..., there is some T G Z_|_ such that, for every t > T, 
a* e L. In systems satisfying its hypotheses. Theorem [T] precludes self stabilization to any set containing 
only committed states. 

4.1 Proof of Main Theorem 

In proving this theorem, we use the following sequence of lemmas. We first show in Lemma [T] that it is 
sufficient to consider systems with I-recall reaction functions. Then in Lemma [2l we argue that such a 
system can be convergent only if every fair trajectory has no committed prefix. To show the existence of 
a fair trajectory with no committed prefix, we show that in any such system, uncommitted histories exist 
(Lemma [3]), and can be extended to a longer uncommitted histories in a way that activates any given node 
(Lemma S]). This means that committed prefixes can be avoided forever on a trajectory that activates every 
node infinitely many times, i.e., a fair trajectory. 
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Lemma 1. If there exists a convergent interaction system with bounded-recall, self-independent reaction 
functions and multiple limit states, then there is also a convergent interaction system with 1-recall, self¬ 
independent reaction functions and multiple limit states. 

Proof. Assume that F = (n, A, f) is a convergent system with self-independent, /c-recall reaction functions and 
multiple limit states, for some k € Consider a 1-recall system F' = (n, A', f'), where A' = A* x ... x A* 
and f' : A' X Z+ —>• A' is given by 




(al...,a^J,t)=[M(al,...,a^J,..., (at 




Informally, a state in F' is the transpose of a fc-history for F. The reaction function f' applies fi to 
this transpose and repeats the output k times. Notice that F' has self-independent reaction functions. 
Furthermore, if (oi,..., a„) is a limit state of F, then ((oi,..., oi),..., (a„,..., On)) is a limit state of F', so 
F' also has multiple limit states. 

Let (T : Z+ —)• be a fair schedule, and let H S (A^)^ be a history of F' for some I G Z_|_. Define the 

schedule tr' : Z+ —>• 2[”1 by 


r a(t/k) t/k G Z_|_ 

0 otherwise. 


Notice that a' is also fair. Let H' G A^^ be the history for F formed by concatenating the fc-tuples in H. It 
is easy to see that the (H, (T)-trajectory of F' converges if and only if the (H', cr')-trajectory of F converges. 
Since we assumed that F is convergent, it follows that F' is also. □ 


Lemma 2. LetT be a convergent system with self-independent, 1-recall reaction functions. Then every fair 
trajectory in F has a committed finite prefix. 

Proof. Assume there exist some history H and fair schedule cr for F such that the (H, tT)-trajectory converges 
to a state a = (oi,..., a„) but has no committed finite prefix. We will construct a fair schedule a' such that 
the (iF, tT')-trajectory does not converge, giving a contradiction. 

Let u°, u^, u^, ... be the (H, cr)-trajectory. Then there is some to G Z+ such that u* = a for all t > tg. 
The fairness of a implies that there is some ti > to such that every node is activated by a between to and 
ti, i.e., a(t) = [n]. By assumption, (u°,..., u*^) is not committed to a, which means there is some 

time t 2 > ti and node i G [n] such that /i(a, ^ 2 ) ^ ai. The fairness of a also implies that there is some to > t 2 
such that i G <j(to). Since to > to, we must have fi{a,to) = ai. By self-independence, then, fi(a',to) = ai 
for all a' such that a'_j = a_i. 

We use these facts to iteratively build our fair schedule a'. In the (iF, cr')-trajectory v°, v^, v^,..., the 
system will repeatedly enter and exit the state a. First, let cr'(t) = cr(t) for all 1 < t < to, so that = a. 
Define cr'(f) on values to < t < t 2 as follows. 


a'(t) 


a{t) to <t <t 2 
{*} t2<t<to. 


By our choices of to, t 2 , and to, this partial schedule activates every node and induces a segment 




of the (H, cr')-trajectory such that v‘ = a whenever to < t < t2 or t = to, but ^ a. 

Now set to = to, select new ti, t2, i, and to relative to this to, and iterate this process to define a'(t) for all 
values t G Z_|_. Notice that a' is fair and that the (iF, (T')-trajectory v°, v^, v^, ... does not converge, which 
contradicts the assumption that the system is convergent. Therefore every fair trajectory in the system must 
have a committed finite prefix. □ 

We will use the following consequence of self independence in the course of proving Lemmas [3] and |4l 

Observation 2. Let H' = (aP,...,a.^) and H' = (b°,...,b^) be committed histories in an system with 
self-independent, 1-recall reaction functions. If = b^j for some i G [n\, then H and H' are committed 
to the same limit state. 
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Proof. Let H = {aP,...,af) and H' = (b°,...,b^) be committed histories such that, for some i £ [n], 
alj = blj, as in Fig. [2] Let tr be any fair schedule such that a{i + 1) = {*}> s-nti consider the 
and (iL'j cr)-trajectories. When node i is activated, it will choose the same action regardless of whether the 
history is H or H', by self independence. As the reaction functions have 1-recall, this means that both 
these trajectories are identical after time £ -I- 1. Thus, since H and H' are both committed, they must be 
committed to the same limit state. □ 






Figure 2: Activating {i} from H — (a°,..., a^) or H' = (b°,..., b^) will have the same outcome. 

Lemma 3. Every interaction system with 1-recall, self-independent reaction functions and more than one 
limit state has at least one uncommitted history. 

Proof. Suppose that every history of length one is committed, and consider two such histories (a) = 
((oi,..., a„)) and (b) = ((6i,..., Observation [2] implies that, for all 1 < i < n, the histories 

((oi,..., Oi-i, bi,..., bn)) and ((ai ,... ,ai, 6i+i,..., bn)) are committed to the same limit state, and therefore 
that a and b are committed to the same limit state. Thus, all histories of length one must be committed 
to the same limit state, and it follows that all histories must be committed to the same limit state. This 
contradicts the system having more than one limit state. □ 

Lemma 4. Let {n,A,f) be an interaction system with self-independent, 1-recall reaction functions and more 
than one limit state, let H = (a°,..., sf~^) £ be an uncommitted history, for some I £ and let i £ [n] 
be a node. Then there exist some t > £ and schedule a such that i £ a{t) and the length-(t -\- 1) prefix of the 
{PI, a)-trajectory is uncommitted. 

Proof. Assume for contradiction that no such t and cr exist. Consider all histories that result from activating 
a set containing i at history H. By assumption, each of these histories is committed. Notice that for all 
S C [n] and j £ [n], the states is{PI) and fsu{j} {H) can only differ at coordinate j. Hence, we can iteratively 
apply Observation O much as in the proof of Lemma [H to see that all these histories must be committed to 
the same limit state, which we call b. 

Let a be any fair schedule, and let a°,a^,... be the (iJ,crj-trajectory. For each t £ Z+, let H* = 
(a°,... ,a‘), and notice that = H. For each t > £, let v* = and let P = 

Since i £ a{t) U {i}, our assumption implies that each history P is committed. Let w* = and 

note that by self independence, = w* also, as illustrated in Fig. [31 Let P = (J*“^,w‘). 

We now show by induction on t that, for every t > £, the history P is committed to b. This holds for 
t = ihy our definition of b. Fix t > £, and suppose that P~^ is committed to b. Then P is also committed 
to b. Consider all histories that result from activating a set containing i at history H*~^. As before, our 
assumption implies that all these histories are committed, and iterative application of Observationshows 
that they are all committed to the same limit state. In particular, P must be committed to the same limit 
state as P, namely b. 

Since cr is a fair schedule, there is some time t for which i £ a{t). For this t, we have H* = P, so 
H* is committed to b. Thus for every fair schedule a, the {H, cr)-trajectory converges to b, contradicting 
the assumption that H is uncommitted. We conclude that our assumption was false and that the lemma 
holds. □ 

Proof of Theorem\^ It follows from Lemmas [3] and [4] that, in every system with I-recall, self-independent 
reaction functions and multiple limit states, it is be possible to activate each node infinitely many times 
without ever reaching a committed history. This means that every such system has a fair trajectory with no 
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cr{£ + 1) 



a{i + 1) U {i} 

i j^+i) u^+ , ,, 

{*} 



Figure 3: All histories in the bottom row are committed to the same limit state b. 


committed prefix. By Lemma [U this implies that no such system may be convergent. The theorem follows 
immediately by Lemma [10 □ 


4.2 Tightness of Main Theorem 

The following two examples demonstrate that the statement of Theorem |T] does not hold if either the self¬ 
independence restriction or the bounded-recall restriction is removed. 

Example The self-independence restriction cannot he removed. 

Consider a system with one node, with action space {a,/3}. When activated, the node always re-selects its 
own current action. Observe that the system is convergent despite having two limit states. 

Example The bounded-recall restriction cannot be removed. 

Consider a system with two nodes, 1 and 2, each with the action space {a, /3}. The self-independent reaction 
functions of the nodes are as follows: node 2 always chooses node I’s action; node 1 will choose /3 if node 
2’s action changed from a to /3 in the past, and a otherwise. Observe that node I’s reaction function has 
unbounded recall: it depends on the entire history of interaction. We make the observations that the system 
is convergent and has two limit states. Observe that if node 1 chooses /3 at some point in time due to the 
fact that node 2’s action changed from a to /3, then it will continue to do so thereafter; if, on the other hand, 

1 never does so, then from some point in time onwards, node I’s action is constantly a. In both cases, node 

2 will have the same action as node 1 eventually, and thus convergence to one of the two limit states, (a, a) 
and {(3,(3), is guaranteed. Hence, two limit states exist and the system is convergent nonetheless. Notice 
also that node I’s reaction functions requires only two states, so the bounded-recall restriction cannot be 
replaced by a memory restriction. 


4.3 Connection to Consensus Protocols 

We now discuss the relationship of our main result (Theorem [T]) to the seminal result of Fischer et al. on 
the impossibility of fault-resilient consensus protocols. The consensus problem is fundamental to distributed 
computing research. We give a brief description of it here, and we refer the reader to [T^ for a detailed 
explanation of the model. 

Fischer et al. studied an environment in which a group of processes, each with an initial value in {0,1}, 
communicate with each other via messages. The objective is for all non-faulty processes to eventually agree 
on some consensus value x € {0,1}, where x must match the initial value of some process. Fischer et al. 
established that no consensus protocol is resilient to even a single failure. Their proof of this breakthrough 
non-termination result introduced the idea of a valency argument. They showed that there exists some initial 
configuration that is bivalent, meaning that the resulting consensus could be either 0 and 1 (the outcome 
depends on the asynchronous schedule of message transmission), and that this bivalence can be maintained. 
Our proof of Theorem |T] also uses a valency argument, where uncommitted histories play the role of bivalent 
configurations. 

^Although our primary focus is on discrete state spaces, we note that in continuous metric spaces, the standard notion of 
convergence only requires indefinite approach; the limit point might never be reached. Accordingly, given a metric d on an 
infinite state space A, one could modify Definition 13.II to say that a trajectory a^, , a^,... converges to a state b if for every 

e > 0 there exists some T S such that, for all t > T, d(a*,b) < e. If we require every limit state to have a committed 
neighborhood, then our proof of Theorem [T] still holds in this setting. 
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Intuitively, the risk of protocol non-termination in the environment studied by Fischer et al. stems 
from the possibility of failures; a computational node cannot tell whether another node is silent due to 
a failure or is simply taking a long time to react. Our non-convergence result concerns environments in 
which nodes/communication channels do not fail. Thus, each node is guaranteed that all other nodes will 
eventually react. Observe that in such an environment reaching a consensus is easy; one pre-specified node 
i (the “dictator”) waits until it learns all other nodes’ inputs (this is guaranteed to happen as failures are 
impossible) and then selects a value Vi and informs all other nodes; then, all other nodes select Vi. By 
contrast, the possibility of non-convergence shown in Theorem [T] stems from limitations on nodes’ behaviors. 
Hence, there is no immediate translation from the result of Fischer et al. to ours (and vice versa). 


5 Applications: Games, Circuits, Social Networks, and Routing 

We present implications of our impossibility result. Theorem [U for several well-studied environments: game 
dynamics, circuit design, social networks, and Internet protocols. For most of these applications, the reaction 
functions are history less. Recalling Definition 13.21 this means that they react only to the current state of 
the system. 


5.1 Asynchronous Game Dynamics 

Traditionally, work in game theory on game dynamics (e.g., best-response dynamics) relies on the explicit or 
implicit premise that players’ behavior is somehow synchronized (in some contexts play is sequential, while 
in others it is simultaneous). Here, we consider the realistic scenario that there is no computational center 
than can synchronize players’ selection of strategies. We describe these dynamics in the setting of this work 
and exhibit an impossibility result for best-response, and more general, dynamics. 

A game is characterized by a triple (n, S, u). There are n players, I,..., n. Each player i has a strategy 
set Si- S = Si X ... X Sn the space of strategy profiles s = (si,..., s„). Each player i has a utility function 
Ui '. S ^ M., where u = (ui .. .Un). Intuitively, player i “prefers” states for which Ui is higher. Informally, a 
player is best responding when it has no incentive to unilaterally change its strategy. 

Definition In a game U = {n,S,u), player i is best responding at s € S' if Ui{s) > Ui{s') for every s' € S 
such that S-i = s'_^. We write Si € BRf{s). A strategy profile s G S is a pure Nash equilibrium {PNE) if 
every player is best responding at s. 

There is a natural relationship between games and the interaction systems described in Section [3] A 
player with a strategy set corresponds directly to a node with an action space, and a strategy profile may be 
viewed as a state. These correspondences are so immediate that we often use these terms interchangeably. 

Consider the case of best-response dynamics for a game in which best responses are unique (a generic 
game): starting from some arbitrary strategy profile, each player chooses its unique best response to other 
players’ strategies when activated. Convergence to pure Nash equilibria under best-response dynamics is the 
subject of extensive research in game theory and economics, and both positive [381132] and negative Emu] 
results are known. If we view each player f in a game (n, S, u) as a node in an interaction system, then 
under best-response dynamics its utility function m induces a self-independent historyless reaction function 
fi : S-i Si, as long as best responses are unique. Formally, 


Ma-i) = argmaxMi(ai,..., a,..., a„). 

otGSi 


Conversely, any system with historyless and self-independent reaction functions can be described as 
following best-response dynamics for a game with unique best responses. Given reaction functions /i,..., 
consider the game where each player f’s utility function is given by 


tti(a) 


1 if/i (a) =ai 
0 otherwise. 


Best-response dynamics on this game replicate the dynamics induced by those reaction functions. Thus 
historyless and self-independent dynamics are exactly equivalent to best-response dynamics. Since pure 
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Nash equilibria are fixed points of these dynamics, the historyless case of Theorem [T] may be restated in the 
following form. 

Theorem 3. If there are two or more pure Nash equilibria in a game with unique best responses, then 
asynchronous best-response dynamics can potentially oscillate indefinitely. 

In fact, best-response dynamics are just one way to derive reaction functions from utility functions, i.e., 
to translate preferences into behaviors. In general, a game dynamics protocol is a mapping from games to 
systems that makes this translation. Given a game (n, S, u) as input, the protocol selects reaction functions 
f = (/i,..., /„), and returns an interaction system (n, S, f). The above non-convergence result holds for a 
large class of these protocols. In particular, it holds for bounded-recall and self-independent game dynamics, 
whenever pure Nash equilibria are limit states. When cast into game-theoretic terminology. Theorem [1] 
says that if players’ choices of strategies are not synchronized, then the existence of two (or more) pure 
Nash equilibria implies that this broad class of game dynamics are not guaranteed to reach a pure Nash 
equilibrium. This result should be contrasted with positive results for such dynamics in the traditional 
synchronous game-theoretic environments. In particular, this result applies to best-response dynamics with 
bounded recall and consistent tie-breaking rules (studied by Zapechelnyuk [43]). 

Theorem 4. If there are two or more pure Nash equilibria in a game with unique best responses, then all 
bounded-recall self-independent dynamics for which those equilibria are fixed points can fail to converge in 
asynchronous environments. 

5.2 Asynchronous Circuits 

The implications of asynchrony for circuit design have been extensively studied in computer architecture 
research [7] . By regarding each logic gate as a node executing an inherently historyless and self-independent 
reaction function, we show that an impossibility result for stabilization of asynchronous circuits follows from 
Theorem [TJ 

In this setting there is a Boolean circuit, represented as a directed graph G, in which the vertices represent 
the circuit’s inputs and logic gates, and the edges represent the circuit’s connections. The activation of the 
logic gates is asynchronous. That is, the gates’ outputs are initialized in some arbitrary way, and then 
the update of each gate’s output, given its inputs, is uncoordinated and unsynchronized. A stable Boolean 
assignment in this framework is an assignment of Boolean values to the circuit inputs and the logic gates 
that is consistent with each gate’s truth table. We say that a Boolean circuit is inherently stable if it is 
guaranteed to converge to a stable Boolean assignment regardless of the initial Boolean assignment. 

To show how Theorem [T] applies to this setting, we model an asynchronous circuit with a fixed input as a 
historyless interaction system with self-independent reaction functions. Every node in the system has action 
space {0,1}. There is a node for each input vertex, which has a constant (and thus self-independent) reaction 
function. For each logic gate, the system includes a node whose reaction function implements the logic gate 
on the actions of the nodes corresponding to its inputs. If any gate takes its own output directly as an input, 
we model this using an additional identity gate; this means that all reaction functions are self-independent. 
Since every stable Boolean assignment corresponds to a limit state of this system, this instability result 
follows from Theorem [T] 

Theorem 5. If two or more stable Boolean assignments exist for an asynchronous Boolean circuit with a 
given input, then that asynchronous circuit is not inherently stable on that input. 

5.3 Diffusion of Technologies in Social Networks 

Understanding the ways in which innovations, ideas, technologies, and practices disseminate through social 
networks is fundamental to the social sciences. We consider the classic economic setting [33] (which has 
lately also been approached by computer scientists [24] 1 where each decision maker has two technologies 
{X,Y} to choose from and wishes to have the same technology as the majority of its “friends” (neighboring 
nodes in the social network). We exhibit a general asynchronous instability result for this environment. 

In this setting there is a social network of users, represented by a connected graph in which users are 
the vertices and edges correspond to friendship relationships. There are two competing technologies, X and 
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Y. Each user will repeatedly reassess his choice of technology, at timesteps separated by arbitrary finite 
intervals. When this happens, the user will select X if at least half of his friends are using X and otherwise 
select V. A “stable global state” is a fixed point of these choice functions, meaning that no user will ever 
again switch technologies. Observe that if every user has chosen X or every user has chosen E, then the 
system is in a stable global state. 

The dynamics of this diffusion can be described as asynchronous best-response dynamics for the game in 
which each player’s utility is 1 if his choice of technology is consistent with the majority (with ties broken 
in favor of X) and 0 otherwise. This game has unique best responses, and the strategy profiles (X,... ,X) 
and (V,... ,V) are both pure Nash equilibria for this game. Thus Theorem [3] implies the following result. 

Theorem 6. In every social network with at least one edge, the diffusion of technologies can potentially 
oscillate indefinitely. 

5.4 Interdomain Routing 

Interdomain routing is the task of establishing routes between the smaller networks, or autonomous systems 
(ASes), that make up the Internet. It is handled by the Border Gateway Protocol (BGP). We abstract a 
recent result of Sami et al. [40] concerning BGP non-convergence and show that this result extends to several 
BGP-based multipath routing protocols that have been proposed in the past few years. 

In the standard model for analyzing BGP dynamics there is a network of source ASes that wish 
to send traffic to a unique destination AS d. Each AS i has a ranking function <i that specifies Ps strict 
preferences over all simple (loop-free) routes leading from i to d. Each AS also has an export policy that 
specifies which routes it is willing to make available to each neighboring AS. Under BGP, each AS constantly 
selects the “best” route that is available to it (see for more details). BGP safety, i.e., guaranteed 
convergence to a stable routing outcome, is a fundamental desideratum that has been the subject of extensive 
work in both the networking and the standards communities. We now cast interdomain routing into the 
terminology of Section |3| to obtain a non-termination results for BGP as a corollary of Theorem |T| 

Each AS acts as a computational node. The action space of each node i is the set of all simple routes 
from i to the destination d, together with the empty route 0. For every state (i?i,..., i?„) of the system, i’s 
reaction function fi considers the set of routes S = {{i,j)Rj : j is Ps neighbor and Rj is exportable to i}. 
If S is empty, fi returns 0. Otherwise, fi selects the route in S that is optimal with respect to <i. Observe 
that this reaction function is deterministic, self-independent, and historyless, and that a stable routing tree 
is a limit state of this system. Thus Theorem |T] implies the following result of Sami et al. 

Theorem 7 (Sami et al. [40] 1. If there are multiple stable routing trees in a network, then BGP is not safe 
on that network. 

Importantly, the asynchronous model of Section [3] is significantly more restrictive than that of Sami et 
ah, so the result implied by Theorem |T] is stronger. Theorem |T] implies an even more general non-safety 
result for routing protocols that depend on ASes that act self-independently and with bounded recall. In 
particular, this includes recent proposals for BGP-based multi-path routing protocols that allow each AS to 
send traffic along multiple routes, e.g., R-BGP |2S] and Neighbor-Specific BGP (NS-BGP) |4T] . 

5.5 Congestion Control 

We now consider the fundamental task of congestion control in communication networks, which is achieved 
through a combination of mechanisms on end-hosts (e.g., TGP) and on switches/routers (e.g., RED and 
WFQ). We briefly describe the model of congestion control studied by Godfrey et al. [16] . 

There is a network of routers, represented by a directed graph G, in which vertices represent routers, 
and edges represent communication links. Each edge e has capacity Cg. There are n source-target pairs of 
vertices {si,ti), termed ‘‘^connections”, that represent communicating pairs of end-hosts. Each source-target 
pair {si,ti) is connected via some fixed route, Ri. Each source Si transmits at a constant rate > 00 
For each of a router’s outgoing edges, the router has a queue management, or queuing, policy, that dictates 
how the edge’s capacity should be allocated between the connections whose routes traverse that edge. The 

^This is modeled via the addition of an edge e = {u,Si) to G, such that Ce = 7 i, and u has no incoming edges. 
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network is asynchronous, so routers’ queuing decisions can be made simultaneously. An equilibrium of the 
network is a global configuration of edges’ capacity allocation such that the incoming and outgoing flows 
on each edge are consistent with the queuing policy for that edge. Godfrey et al. show that, while one 
might expect flow to be received at a constant rate whenever it is transmitted at a constant rate, this is 
not necessarily the case. Indeed, Godfrey et al. present examples in which connections’ throughputs can 
potentially fluctuate ad infinitum, never converging an equilibrium. 

We model such a network as a historyless interaction system to show that every network with multiple 
equilibria can oscillate indefinitely. The computational nodes of the system are the edges. The action space of 
each edge e intuitively consists of all possible ways to divide traffic going through e between the connections 
whose routes traverse e. More formally, for every edge e, let N{e) be the number connections whose paths 
go through e. Then e’s action space is Ae = { [xi ,..., a; 7 v(e)) : each xt > 0 and < Ce}. We assume 
that the Xi have bounded precision, meaning that the state space is finite. 

Edge e’s reaction function /e models the queuing policy according to which e’s capacity is shared: for 
every A^(e)-tuple of nonnegative incoming flows (u>i,..., WAr(e)), fe outputs an action (xi,... ,X 7 v(e)) G Ag 
such that for every i G [fV(e)], Xi < Wi —a connection’s flow leaving the edge cannot exceed its flow entering 
the edge. These reaction functions are historyless and self-independent, and an equilibrium of the network 
is a limit state of this system. Using Theorem [U then, we can obtain the following impossibility result. 

Theorem 8. If there are multiple capacity-allocation equilibria in the network, then dynamics of congestion 
control can potentially oscillate indefinitely. 


6 Complexity of Asynchronous Dynamics 

We now turn to the communication complexity and computational complexity of determining whether a 
system is convergent. We present hardness results in both models of computation even for the case of 
historyless interaction. Our computational complexity result shows that even if nodes’ reaction functions 
can be succinctly represented, determining whether the system is convergent is PSPACE-complete. Alongside 
its computational implications, this intractability result implies that (unless PSPAGE C NP) we cannot hope 
to have short, efficiently verifiable certificates that guarantee a system’s convergence. 

6.1 Communication Complexity 

The following result shows that, in general, determining whether a system is convergent cannot be done 
efficiently. 

Theorem 9. Determining if a system with n nodes, each with 2 actions, is convergent requires communi¬ 
cating n(2"') bits. This holds even if all nodes have historyless and self-independent reaction functions. 

Proof. To prove our result we present a reduction from the 2-party set disjointness problem, a well-known 
problem in communication complexity theory: There are two parties, Alice and Bob. Each party holds a 
subset of [g]; Alice holds the subset A and Bob holds the subset B. The objective is to determine whether 
A n i? = 0. This problem instance is denoted DiSj'^(A, B). The following is well known. 

Theorem 10. Determining whether A,BC [g] are disjoint requires (in the worst case) the communication 
of D,{q) bits. This lower bound applies to randomized protocols with bounded 2-sided error and also to 
nondeterministic protocols. 

We now present a reduction from 2-party set disjointness to the question of determining whether a system 
with historyless and self-independent reaction functions is convergent. Given an instance DiSj'^(A, i?), we 
construct a system with n nodes, each with two actions, as follows. (The relation between the parameters 
q and n is to be specified later.) Let the action space of each node be {0,1}. We now define the reaction 
functions of the nodes. Consider the possible action profiles of nodes 3,..., n, i.e., the set {0,1}"“^. Observe 
that this set of actions profiles is the (n — 2)-hypercube Qn- 2 , and thus can be visualized as the graph whose 
vertices are indexed by the binary (n — 2)-tuples and such that two vertices are adjacent if and only if they 
differ in exactly one coordinate. The reaction functions are based on following a snake in this hypercube. 
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Definition A snake in a hypercube Qn is a simple cycle S = {vq, ... ,Vk) that is chordless, i.e., for each 
VijVj on S, if Vi and Vj are neighbors in Qn, then Vj € {vi-i,Vi+i}. 


Let S' be a maximal snake in Qn-2, and let q = |S|. We now show our reduction from Disj'^. We identify 
each element j G [g] with a unique vertex G S. Without loss of generality, we assume that 0"“^ is on 
S. For ease of exposition we also assume that 1"“^ is not on S. (Getting rid of this assumption is easy.) 
Orient the edges in S to form a cycle. For any edge that has exactly one endpoint in S, orient the edge 
toward S. An example is given in Fig. 01 Orient all other edges arbitrarily. For each i = 3,..., n, this 
orientation induces a function gi : Qn-2 —t { 0 , 1 }, where gi(a^, ■ ■ ■, an) is determined by the direction of the 
edge {(as,... Oi-i, 0, ai+i, ..., n), ( 03 ,... Oi-i, 1, ai+i, ..., n)}. The nodes’ self-independent and historyless 
reaction functions are as follows. 


fi{ai, ...,an) 

/2(ai, ...,an) 

fi (oi) • ■ • ) an) 


0 if 02 = 1 and ( 03 ,..., On) = for some j G A 
1 otherwise 

0 if oi = 1 and ( 03 ,..., a„) = for some j G B 
1 otherwise 

9 i{ai ^,..., an) if oi — 02 — 0 
1 otherwise 


Informally, the aim is for nodes 3,..., n to follow the snake S to vertices corresponding to each value j G [g]. 



Figure 4: An acceptable orientation of the edges on Q 4 . The solid edges form a cycle on a maximal snake 
S, and no edge is directed away from S. 

Observation 11. 1" is the unique limit state of the system. 

In our reduction Alice simulates node I (whose reaction function is based on A), Bob simulates node 2 
(whose reaction function is based on B), and one of the two parties simulates all other nodes (whose reaction 
functions are based on neither A nor B). The theorem now follows from the combination of the following 
two claims. 

Claim 12. In an oscillation there must be infinitely many time steps in which both node 1 and 2’s actions 
are 0 . 

Proof. Suppose that from some moment forth it is never the case that both node 1 and 2’s actions are 0. 
Observe that from that time onwards the nodes 3,... ,n will always choose the action 1 when activated. 
Hence, after some time has passed the actions of all nodes in {3 ,... ,n} will be 1. Observe that whenever 
nodes 1 and 2 are activated thereafter they shall choose the action 1 , so we have convergence to the limit 
state 1 ”. □ 
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Claim 13. The system is convergent iff ACi B = lli. 

Proof. If A n B 7^ 0 , initialize the system to a state ( 0 , 0 , 03,, a„), where (03,..., a„) = v ^. Consider a 
schedule that activates {3,..., n} in every timestep until (03, ..., a„) = for some j € Ad B. When that 
happens, the schedule activates {1, 2} for two consecutive timesteps, then resumes activating {3,..., n}. The 
functions gi ensure that, for each j € [5], the vector (03,..., a„) will be equal to Vj within a finite number of 
timesteps. Since there is some j € ADB, nodes 1 and 2 will eventually be activated, so this schedule is fair. 
This initial state and schedule clearly produce an oscillation, so the system is not convergent in this case. 

Now assume for contradiction that AdB = % and the system is not convergent, we know from Claim [T^ 
that if there is an oscillation, then there are infinitely many time steps in which both node 1 and 2’s actions 
are 0. We argue that this implies that there must be infinitely many time steps in which both nodes select 
action 0 simultaneously. Indeed, node 1 only chooses action 0 if node 2’s action is I, and vice versa, and so 
if both nodes never choose 0 simultaneously, then it is never the case that both nodes’ actions are 0 at the 
same time step, a contradiction. Now, when is it possible for both 1 and 2 to choose 0 at the same time? 
Observe that this can only be if the actions of nodes 3,..., n constitute an element that is in both A and B. 
Hence, Ad B %, another contradiction. □ 

We have reduced DiSJ®(A, B), which requires Tl{q) bits of communication, to the problem of checking 
convergence of an n-node system with historyless and self-independent reaction functions. As q was defined 
as the length of a maximal snake in a classical combinatorial result due to Evdokimov shows that 
q = n(2^). 

Theorem 14 (Evdokimov [12]). Let z € Z_|_ be sufficiently large. Then, the size IBj of a maximal snake in 
the z-hypercube is at least A2^ for some A > 0. 

This completes the proof of Theorem O □ 

6.2 Computational Complexity 

The above communication complexity hardness result required the representation of the reaction functions 
to (potentially) be exponentially long. What if the reaction functions can be succinctly described? We now 
present a strong computational complexity hardness result for the case that each reaction function fi is 
history less, and is given explicitly in the form of a Boolean circuit. 

Theorem 15. When the reaction functions are given as Boolean circuits, determining whether a historyless 
system with n nodes is convergent is PSPACE-complete. 

Proof. Our proof is based on the proof of Eabrikant and Papadimitriou [13] that checking BGP safety is 
PSPACE-complete. Importantly, that result does not imply Theorem [151 since the model of asynchronous 
interaction considered in that work does not allow for simultaneous activation of nodes. We prove our 
theorem by reduction from the problem of determining whether a linear space-bounded Turing machine 
(TM) will halt from every starting configuration. 

Eor n G let M he a TM that can only access the first n tape cells. Let Q be M’s machine state 
space, r its tape alphabet, and diQxP—j-QxPx { — 1,0,1} its transition function. A configuration of M 
is a triple {q,a,j) where, q G Q is a machine state, a e P” describes the tape contents, and j G [n] gives the 
location of the control head. 

Then we say that (M, 1") is in SHC (for ^ace-bounded halting from all configurations) if, for every 
configuration, M will halt if it begins its computation from that configuration. As Eabrikant and Papadim¬ 
itriou m argue, the problem of checking whether a space-bounded TM will accept a blank input is reducible 
to SHC, and thus SHC is PSPACE-hard. 

We now reduce SHC to the problem of determining whether a historyless interaction system is convergent. 
Given n G and a TM M that can only access the first n tape cells, we construct a historyless system 
that is convergent if and only if (M, I") is in SHC. The system has a cell node for each of the first n tape 
cells and a head node to represent the control head. The cell nodes 1,... ,n each have the action space P, 
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and the head node n + 1 has action space Q x F x [n] x {—1,0,1}. For a £ F", each cell node i £ [n] has the 
reaction function 

/,(a,(g,7,J,rf)) = { otoise. 

For the head node, /n,+i(a, {q,j,j,d)) is given by the following procedure. 

if aj = 7 then 

a j + d£ [n] then 

j ^j + d; 

(g, 7 ,d) ^ 6 {q,aj); 
return (g, 7 ,j,d); 

Observe that (a, (g, 7 , j, d)) is a limit state of this system if and only if g is a halting machine state for M. 
Suppose the system is convergent, and let {q^a.,j) be a configuration of M. Consider the system trajectory 
that begins at state (a, {q,aj,j,0)) and follows the schedule that activates every node in every round. This 
trajectory will reach a limit state, and it corresponds directly to a halting run of M that begins from (g, a, g), 
so (M, 1") is in SHC. 

Conversely, suppose that (M, 1") is in SHC, and let (a, (g, 7 , j, d)) be a system state. Suppose that aj = 7 . 
Then consider the computation by M that begins at configuration (g, a, j'), where j' = min{max{j +d, 1}, n}. 
This computation will reach a halting machine state, and any fair trajectory from (a, (g, 7 , g, d)) will go 
through the same sequence of machine states as this computation, which means that the trajectory will 
converge to a limit state. If aj 7 ^ 7 , then the system state will remain the same until j is activated and 
takes action 7 , after which the above argument applies. Thus the system is convergent. We conclude that 
checking whether the system is convergent is PSPACE-hard. Since it is straightforward to check convergence 
in polynomial space, this problem is PSPACE-complete. □ 

In a preliminary version of this work [^, we conjectured that the above PSPACE-completeness result 
also holds for the case of self-independent reaction functions. 

Conjecture 16. Determining whether a system with n nodes, each with a deterministic self-independent 
and historyless reaction function, is convergent is PSPACE-complete. 

This conjecture has since been proved by Engelberg et al. m- 


7 Conclusions and Future Research 

In this paper, we have taken the first steps towards a complete understanding of strategic dynamics in 
distributed settings. We proved a general non-convergence result and several hardness results within this 
model. We also discussed some important aspects such as the implications of fairness and randomness, as 
well as applications to a variety of settings. We believe that we have only scratched the surface in the 
exploration of the convergence properties of game dynamics in distributed computational environments, 
and many important questions remain wide open. We now outline several interesting directions for future 
research. 

Other limitations, convergence notions, and equilibria. We have considered particular limitations on 
reaction functions, modes of convergence, and kinds of equilibria. Understanding the effects of asynchrony 
on different classes of reaction functions (e.g., uncoupled dynamics, dynamics with outdated information) 
and for other types of convergence (e.g., of the empirical distributions of play) and equilibria (e.g., mixed 
Nash equilibria, correlated equilibria) is a broad and challenging direction for future research. 

Other notions of asynchrony. We believe that better understanding the role of degrees of fairness, ran¬ 
domness, and other restrictions on schedules from distributed computing literature, in achieving convergence 
to equilibrium points is an interesting and important research direction. 

Characterizing asynchronous convergence. We still lack characterizations of asynchronous convergence 
even for simple dynamics (e.g., deterministic and historyless). Our PSPACE-completeness result in Section| 6 ] 
eliminates the possibility of short witnesses of guaranteed asynchronous convergence unless PSPACE C NP, 
but elegant characterizations are still possible. 


15 


Topological and knowledge-based approaches. Topological [H [531ISH] and knowledge-based [H] ap¬ 
proaches have been very successful in addressing fundamental questions in distributed computing. Can these 
approaches shed new light on the implications of asynchrony for strategic dynamics? 

Further exploring the environments of Section |5l We have applied our main non-convergence result 
to the environments described in Section |SJ These environments are of independent interest and are indeed 
the subject of extensive research. Hence, the further exploration of dynamics in these settings is important. 
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